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SEQUENCE LISTING 

(1) GENERAL INFORMATION; 

(i) APPLICANT: Washington University 
(ii) TITLE OF INVENTION: BLNK PROTEINS 
(iii) NUMBER OF SEQUENCES: 12 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

(B) STREET: Four Embarcadero Center, Suite 3400 

(C) CITY: San Francisco 
15 (D) STATE: California 

(E) COUNTRY: United States 

(F) ZIP: 94111-4187 

(v) COMPUTER READABLE FORM: 
20 (A) MEDIUM TYPE: Floppy disk 

i3 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM; PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 



25 <vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US98/ 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

30 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/819,013 

(B) FILING DATE: 17 -MAR- 1997 

(vii) PRIOR APPLICATION DATA: 
35 (A) APPLICATION NUMBER: US 08/788,322 

<B) FILING DATE: 24-JAN-1997 

(Viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Silva, Robin M. 
40 (B) REGISTRATION NUMBER: 38,304 

(C) REFERENCE /DOCKET NUMBER: FP- 64383 -2/RFT/RMS 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (415) 781-1989 
45 (B) TELEFAX: (415) 398-3249 

(C) TELEX: 910 277299 



(2) INFORMATION FOR SEQ ID NO : 1 : 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
55 (D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Asp Lys Leu Asn Lys lie Thr Val Pro Ala Ser Gin Lys Leu Arg 
15 10 15 

Gin Leu Gin Lys Met Val His Asp lie Lys Asn Asn Glu Gly Gly lie 

20 25 30 

Met Asn Lys lie Lys Lys Leu Lys Val Lys Ala Pro Pro Ser Val Pro 
35 40 45 

Arg Arg Asp Tyr Ala Ser Glu Ser Pro Ala Asp Glu Glu Glu Gin Trp 
50 55 60 

Ser Asp Asp Phe Asp Ser Asp Tyr Glu Asn Pro Asp Glu His Ser Asp 
65 70 75 80 

Ser Glu Met Tyr Val Met Pro Ala Glu Glu Asn Ala Asp Asp Ser Tyr 

85 90 95 

Glu Pro Pro Pro Val Glu Gin Glu Thr Arg Pro Val His Pro Ala Leu 

100 105 110 

Pro Phe Ala Arg Gly Glu Tyr lie Asp Asn Arg Ser Ser Gin Arg His 
115 120 125 

Ser Pro Pro Phe Ser Lys Thr Leu Pro Ser Lys Pro Ser Trp Pro Ser 
130 135 140 

Glu Lys Ala Arg Leu Thr Ser Thr Leu Pro Ala Leu Thr Ala Leu Gin 
145 150 155 160 

Lys Pro Gin Val Pro Pro Lys Pro Lys Gly Leu Leu Glu Asp Glu Ala 

165 170 175 

Asp Tyr Val Val Pro Val Glu Asp Asn Asp Glu Asn Tyr lie His Pro 

180 185 190 

Thr Glu Ser Ser Ser Pro Pro Pro Glu Lys Ala Pro Met Val Asn Arg 
195 200 205 

Ser Thr Lys Pro Asn Ser Ser Thr Pro Ala Ser Pro Pro Gly Thr Ala 
210 215 220 

Ser Gly Arg Asn Ser Gly Ala Trp Glu Thr Lys Ser Pro Pro Pro Ala 
225 230 235 240 

Ala Pro Ser Pro Leu Pro Arg Ala Gly Lys Lys Pro Thr Thr Pro Leu 

245 250 255 



Lys Thr Thr Pro Val Ala Ser Gin Gin Asn Ala Ser Ser Val Cys Glu 

260 265 270 
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Glu Lys Pro He Pro Ala Glu Arg His Arg Gly Ser Ser His Arg Gin 
275 280 285 

Glu Ala Val Gin Ser Pro Val Phe Pro Pro Ala Gin Lys Gin lie His 
290 295 300 

Gin Lys Pro He Pro Leu Pro Arg Phe Thr Glu Gly Gly Asn Pro Thr 
305 310 315 320 

Val Asp Gly Pro Leu Pro He Phe Ser Ser Asn Ser Thr He Ser Glu 

325 330 335 

Gin Glu Ala Gly Val Leu Cys Lys Pro Trp Tyr Ala Gly Ala Cys Asp 

340 345 350 

Arg Lys Ser Ala Glu Glu Ala Leu His Arg Ser Asn Lys Asp Gly Ser 
355 360 365 

Phe Leu lie Arg Lys Ser Ser Gly His Asp Ser Lys Gin Pro Tyr Thr 
370 375 380 

Leu val Val Phe Phe Asn Lys Arg Val Tyr Asn He Pro Val Arg Phe 
385 390 395 400 

He Glu Ala Thr Lys Gin Tyr Ala Leu Gly Arg Lys Lys Asn Gly Glu 

405 410 415 

Glu Tyr Phe Gly Ser Val Ala Glu He He Arg Asn His Gin His Ser 

420 425 430 

Pro Leu Val Leu He Asp Ser Gin Asn Asn Thr Lys Asp Ser Thr Arg 
435 440 445 

Leu Lys Tyr Ala Val Lys Val Ser 
450 455 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1806 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

CCTTCGTGGC CGCAGCCTGC ACTCTCAGAA ATCAGACTTG AGTGGCCGGA ACCCTTGAGA 60 

CCAGAGGCTT ACCATGCTGC TCC CTAGGAG GGCCAGGAAC TGCTGACGTG ACCACTGGAC 12 0 

AGTTATTCGT GTCTCTTACA ATTACCAAAC AGAATGGACA AGCTTAATAA AATAACCGTC 18 0 

CCCGCCAGTC AGAAGTTGAG GCAGCTTCAA AAGATGGTCC ATGATATTAA AAACAATGAA 24 0 
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GGTGGAATAA TGAATAAAAT CAAAAAGCTA AAAGTCAAAG CACCTCCAAG TGTTCCTCGA 300 

AGGGACTACG CTTCAGAGAG CCCCGCTGAC GAAGAGGAGC AGTGGTCCGA TGACTTTGAC 36 0 

5 AGCGACTATG AAAATCCAGA TGAGCACTCG GACTCAGAGA TGTACGTGAT GCCCGCCGAG 42 0 

GAGAACGCTG ATGACAGCTA CGAGCCGCCT CCAGTAGAGC AGGAAACCAG GCCGGTTCAC 480 

CCAGCCCTGC CCTTCGCCAG AGGCGAGTAT ATAGACAATC GATCAAGCCA GAGGCATTCC 54 0 

CCACCCTTCA GCAAGACACT TCCCAGTAAG CCCAGCTGGC CTTCAGAGAA AGCAAGGCTC 6 00 

ACCTCCACCC TGCCGGCCCT GACTGCTTTG CAGAAACCTC AAGTCCCACC CAAACCCAAA 660 

15 GGCCTCCTTG AGGATGAGGC TGATTATGTG GTCCCCGTGG AAGATAATGA TGAAAACTAT 720 

ATTCATCCCA CAGAAAGCAG TTCACCTCCA CCTGAAAAAG CTCCCATGGT GAATAGATCA 78 0 

ACCAAGCCAA ATTCCTCAAC GCCCGCCTCT CCTCCAGGAA CAGCTTCAGG TCGAAACAGT 84 0 

20 

GGGGCCTGGG AAACCAAGTC ACCTCCACCA GCTGCACCAT CCCCGTTGCC ACGGGCCGGG 900 

AAAAAACCAA CGACACCACT GAAGACAACT CCAGTTGCCT CTCAACAGAA TGCTTCAAGT 960 

2 5 GTTTGTGAAG AAAAACCTAT ACCTGCTGAA CGCCACCGAG GGTCAAGTCA CAGACAAGAA 1020 

GCTGTGCAGT CACCAGTGTT TCCTCCTGCC CAGAAACAAA TCCACCAAAA ACCCATACCT 1080 

CTGCCAAGAT TTACAGAAGG GGGAAACCCA ACTGTGGATG GGCCCCTACC CAGCTTTTCA 114 0 

30 

TCTAATTCCA CTATTTCAGA ACAGGAAGCT GGCGTTCTCT GCAAGCCATG GTATGCTGGA 12 00 

GCCTGTGATC GAAAGTCTGC TGAAGAGGCA TTGCACAGAT CAAACAAGGA TGGATCATTT 12 6 0 

3 5 CTTATTCGGA AAAG CTCTGG CCATGATTCC AAACAACCAT ATACACTAGT TGTATTCTTT 132 0 

AATAAGCGAG TATATAATAT TCCTGTGCGA TTTATTGAAG CAACAAAACA ATATGCCTTG 13 80 

GGCAGAAAGA AAAATGGTGA AGAGTACTTT GGAAGTGTTG CTGAAATCAT CAGGAATCAT 1440 

40 

CAACATAGTC CTTTGGTTCT TATTGACAGT CAGAATAACA CAAAAGATTC CACCAGACTG 1500 

AAGTATGCAG TTAAAGTTTC ATAAAGGGGG AAAAAAAAGA TCAATACCAT TGCTTCAGAC 1560 

45 ACTTTCCCAA AGTTTCTCCT TTTGAGAAAA AGTCCCAAAA CTTCATATTT TGGATTATGA 1620 

ATCATCCAGT AATAAAATGG AAGATGGAGT CAGCTATTGA AGTGGTCATC CATTTCTTTT 1680 

TAAGAAGCTC ATGTGGACTT GTTCTATTGC CTGACCTGAT GAACTGTTAA TATCTGGTGA 1740 

50 

GGTTGAGTTA TCATGCTACT AATATTTTCC AAATAAATAT TTTTATTTTT AAAAAAAAAA 1800 

AAAAAA 1806 
55 (2) INFORMATION FOR SEQ ID NO : 3 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met Asp Lys Leu 
1 

His lie Lys Asn 

20 

Lys Val Lys Ala 
35 



Asn Lys lie Thr 
5 

Asn Glu Gly Gly 



Pro Pro Ser Val 

40 



Val Pro Ala Ser 
10 

lie Met Asn Lys 
25 

Pro Arg Arg Asp 



Gin Lys Leu Arg 
15 

lie Lys Lys Leu 
30 

Tyr Ala Ser Glu 
45 



Ser Pro Ala Asp 
50 

Tyr Glu Asn Pro 
65 

Ala Glu Glu Asn 



Glu Thr Arg Pro 

100 

lie Asp Asn Arg 
115 



Glu Glu Glu Gin 
55 

Asp Glu His Ser 
70 

Ala Asp Asp Ser 
85 

Val His Pro Ala 



Ser Ser Gin Arg 

120 



Trp Ser Asp Asp 

60 

Asp Ser Glu Met 
75 

Tyr Glu Pro Pro 
90 

Leu Pro Phe Ala 
105 

His Ser Pro Pro 



Phe Asp Ser Asp 



Tyr Val Met Pro 

80 

Pro Val Glu Gin 
95 

Arg Gly Glu Tyr 
110 

Phe Ser Lys Thr 
125 



Leu Pro Ser Lys 
130 

Thr Leu Pro Ala 
145 

Pro Lys Gly Leu 



Asp Asn Asp Glu 

180 

Pro Glu Lys Ala 
195 

Thr Pro Ala Ser 
210 



Pro Ser Trp Pro 
135 

Leu Thr Ala Leu 
150 

Leu Glu Asp Glu 
165 

Asn Tyr lie His 



Pro Met Val Asn 

200 

Pro Pro Gly Thr 
215 



Ser Glu Lys Ala 

140 

Gin Lys Pro Gin 
155 

Ala Asp Tyr Val 
170 

Pro Thr Glu Ser 
185 

Arg Ser Thr Lys 



Ala Ser Gly Arg 

220 



Arg Leu Thr Ser 



Val Pro Pro Lys 

160 

Val Pro Val Glu 
175 

Ser Ser Pro Pro 
190 

Pro Asn Ser Ser 
205 

Asn Ser Gly Ala 



Trp Glu Thr Lys Ser Pro Pro Pro Ala Ala Pro Ser Pro Leu Pro Arg 
225 230 235 240 
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Ala Gly Lys Lys Pro Thr Thr Pro Leu Lys Thr Thr Pro Val Ala Ser 

245 250 255 

Gin Gin Asn Ala Ser Ser Val Cys Glu Glu Lys Pro lie Pro Ala Glu 
5 260 265 270 

Arg His Arg Gly Ser Ser His Arg Gin Glu Ala Val Gin Ser Pro Val 
275 280 285 

10 Phe Pro Pro Ala Gin Lys Gin lie His Gin Lys Pro lie Pr*o Leu Pro 

290 295 300 

Arg Phe Thr Glu Gly Gly Asn Pro Thr Val Asp Gly Pro Leu Pro Ser 
305 310 315 320 



15 



30 



50 



Phe Ser Ser Asn Ser Thr lie Ser Glu Gin Glu Ala Gly Val Leu Cys 

325 330 335 



Lys Pro Trp Tyr Ala Gly Ala Cys Asp Arg Lys Ser Ala Glu Glu Ala 
20 340 345 350 

Leu His Arg Ser Asn Lys Asp Gly Ser Phe Leu lie Arg Lys Ser Ser 
355 360 365 

25 Gly His Asp Ser Lys Gin Pro Tyr Thr Leu Val Val Phe Phe Asn Lys 

370 375 380 

Arg Val Tyr Asn lie Pro Val Arg Phe lie Glu Ala Thr Lys Gin Tyr 
385 390 395 400 



Ala Leu Gly Arg Lys Lys Asn Gly Glu Glu Tyr Phe Gly Ser Val Ala 

405 410 415 



Glu lie He Arg Asn His Gin His Ser Pro Leu Val Leu He Asp Ser 
35 420 425 430 

Gin Asn Asn Thr Lys Asp Ser Thr Arg Leu Lys Tyr Ala Val Lys Val 
435 440 445 

40 Ser 



(2) INFORMATION FOR SEQ ID NO : 4 : 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNE S S : unknown 

(D) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



55 CCTTCGTGGC CGCAGCCTGC ACTCT CAGAA ATCAGACTTG AGTGGCCGGA ACCCTTGAGA 6 0 
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CCAGAGGCTT ACCATGCTGC TCCCTAGGAG GGCCAGGAAC TGCTGACGTG AC C ACT GGAC 120 

AGTTATTCGT GTCTCTTACA ATTACCAAAC AGAATGGACA AG CTTAATAA AATAACCGTC 18 0 

5 CCCGCCAGTC AGAAGTTGAG GCATATTAAA AACAATGAAG GTGGAATAAT GAATAAAATC 24 0 

AAAAAGCTAA &AGTCAAAGC AC CTCCAAGT GTTCCTCGAA GGGACTACGC TTCAGAGAGC 3 00 

CCCGCTGACG AAGAGGAGCA GTGGTCCGAT GACTTTGACA GCGACTATGA AAAT CCAGAT 360 

GAGCACTCGG ACTCAGAGAT GTACGTGATG CCCGCCGAGG AGAAC GCTG A TGACAG CTAC 420 

GAGCCGCCTC CAGTAGAGCA GGAAACCAGG CCGGTTCACC CAGCCCTGCC CTTCGCCAGA 480 

15 GGCGAGTATA TAGACAATCG ATCAAGCCAG AGGCATTCCC CACCCTTCAG CAAGACACTT 540 

CCCAGTAAGC CCAGCTGGCC TTCAGAGAAA GCAAGGCTCA CCTCCACCCT GCCGGCCCTG 600 

ACTGCTTTGC AGAAAC CTCA AGTCCCACCC AAACCCAAAG GCCTCCTTGA GGATGAGGCT 660 

20 

GATTATGTGG TCCCCGTGGA AGATAATGAT GAAAACTATA TTCATCCCAC AGAAAGCAGT 720 

TCACCTCCAC CTGAAAAAGC TCCCATGGTG AATAGATCAA CCAAGCCAAA TTCCTCAACG 780 

25 CCCGCCTCTC CTC CAGGAAC AGCTTCAGGT CGAAACAGTG GGGCCTGGGA AACCAAGTCA 840 

CCTCCACCAG CTGCACCATC CCCGTTGCCA CGGGCCGGGA AAAAACCAAC GACACCACTG 900 

AAGACAACTC CAGTTGCCTC TCAACAGAAT GCTTCAAGTG TTTGTGAAGA AAAACCTATA 960 

30 

CCTGCTGAAC GCCACCGAGG GTCAAGTCAC AGACAAGAAG CTGTGCAGTC ACCAGTGTTT 1020 

CCTCCTGCCC AGAAACAAAT CCACCAAAAA CCCATACCTC TGCCAAGATT TACAGAAGGG 1080 

35 GGAAACCCAA CTGTGGATGG GCCCCTACCC AGCTTTTCAT CTAATTCCAC TATTTCAGAA 114 0 

CAGGAAGCTG GCGTTCTCTG CAAGCCATGG TATGCTGGAG CCTGTGATCG AAAGTCTGCT 12 00 

GAAGAGGCAT TGCACAGATC AAACAAGGAT GGATCATTTC TTATTCGGAA AAGCTCTGGC 126 0 

40 

CATGATTCCA AACAACCATA TACACTAGTT GTATTCTTTA ATAAGCGAGT ATATAATATT 132 0 

CCTGTGCGAT TTATTGAAGC AACAAAACAA TATGCCTTGG GCAGAAAGAA AAATGGTGAA 1380 

4 5 GAGTACTTTG GAAGTGTTGC TGAAATCATC AGGAATCATC AACATAGTCC TTTGGTTCTT 1440 

ATTGACAGTC AGAATAACAC AAAAGATTCC ACCAGACTGA AGTATGCAGT TAAAGTTTCA 1500 

TAAAGGGGGA AAAAAAAGAT CAATACCATT GCTTCAGACA CTTTCCCAAA GTTTCTCCTT 156 0 

50 

TTGAGAAAAA GTCCCAAAAC TTCATATTTT GGATTATGAA TCATCCAGTA ATAAAATGGA 162 0 

AGATGGAGTC AGCTATTGAA GTGGTCATCC ATTTCTTTTT AAGAAGCTCA TGTGGACTTG 168 0 

55 TTCTATTGCC TGACCTGATG AACTGTTAAT ATCTGGTGAG GTTGAGTTAT CATGCTACTA 174 0 
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ATATTTTCCA AATAAATATT TTTATTTTTA AAAAAAAAAA AAAAA 1785 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

15 Met Asp Lys Leu Asn Lys lie Thr Val Pro Ala Ser Gin Lys Leu Arg 

15 10 15 

Gin Leu Gin Lys Met Val His Asp lie Lys Asn Asn Glu Gly Gly lie 

20 25 30 



Met Asp Lys lie Lys Lys Leu Lys Val Lys Gly Pro Pro Ser Val Pro 
35 40 45 



Arg Arg Asp Tyr Ala Leu Asp Ser Pro Ala Asp Glu Glu Glu Gin Trp 
25 50 55 60 

Ser Asp Asp Phe Asp Ser Asp Tyr Glu Asn Pro Asp Glu His Ser Asp 
65 70 75 80 

3 0 Ser Glu Met Tyr Val Met Pro Ala Glu Glu Thr Gly Asp Asp Ser Tyr 

85 90 95 

Glu Pro Pro Pro Ala Glu Gin Gin Thr Arg Val Val His Pro Ala Leu 

100 105 110 



Pro Phe Thr Arg Gly Glu Tyr Val Asp Asn Arg Ser Ser Gin Arg His 
115 120 125 



Ser Pro Pro Phe Ser Lys Thr Leu Pro Ser Lys Pro Ser Trp Pro Ser 
40 130 135 " 140 

Ala Lys Ala Arg Leu Ala Ser Thr Leu Pro Ala Pro Asn Ser Leu Gin 
145 150 155 160 

45 Lys Pro Gin Val Pro Pro Lys Pro Lys Asp Leu Leu Glu Asp Glu Ala 

165 170 175 

Asp Tyr Val Val Pro Val Glu Asp Asn Asp Glu Asn Tyr lie His Pro 

180 185 190 

50 

Arg Glu Ser Ser Pro Pro Pro Ala Glu Lys Ala Pro Met Val Asn Arg 
195 200 205 

Ser Thr Lys Pro Asn Ser Ser Ser Lys His Met Ser Pro Pro Gly Thr 
55 210 215 220 
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Val Ala Gly Arg Asn Ser Gly Val Trp Asp Ser Lys Ser Ser Leu Pro 
225 230 235 240 

Ala Ala Pro Ser Pro Leu Pro Arg Ala Gly Lys Lys Pro Ala Thr Pro 
5 245 250 255 

Leu Lys Thr Thr Pro Val Pro Pro Leu Pro Asn Ala Ser Asn Val Cys 

260 265 270 

10 Glu Glu Lys Pro Val Pro Ala Glu Arg His Arg Gly Ser Ser His Arg 

275 280 285 



15 



Gin Asp Thr Val Gin Ser Pro Val Phe Pro Pro Thr Gin Lys Pro Val 
290 295 300 

His Gin Lys Pro Val Pro Leu Pro Arg Phe Pro Glu Ala Gly Ser Pro 
305 310 315 320 

Ala Ala Asp Gly Pro Phe His Ser Phe Pro Phe Asn Leu Thr Phe Ala 
20 325 330 335 

Asp Gin Glu Gly Glu Leu Leu Gly Lys Pro Trp Tyr Ala Gly Ala Cys 

340 345 350 

25 Asp Arg Lys Phe Ala Glu Glu Ala Leu His Arg Ser Asn Lys Asp Gly 

355 360 365 



30 



Ser Phe Leu lie Arg Lys Ser Phe Gly His Asp Ser Lys Gin Pro Tyr 
370 375 380 

Thr Leu Val Ala Phe Phe Asn Lys Arg Val Tyr Asn lie Pro Val Arg 
385 390 395 400 

Phe lie Glu Ala Thr Lys Gin Tyr Ala Leu Gly Lys Lys Lys Asn Gly 
35 405 410 415 

Glu Glu Tyr Phe Gly Ser Val Val Glu lie Val Asn Ser His Gin His 

420 425 430 

40 Asn Pro Leu Val Leu lie Asp Ser Gin Asn Asn Thr Lys Asp Ser Thr 

435 440 445 

Arg Leu Lys Tyr Ala Val Lys Val Ser 
450 455 



45 



55 



(2) INFORMATION FOR SEQ ID NO ; 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1718 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
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CTGTGGGTTG CTCGCAGAAG TCAGTTCCAG 
CATGCTGCTC CCCAGGAAGT CCAGGAGCTG 
5 CTCTCAAGAA AATGGACAAG CTGAATAAGA 
AGCTTCAAAA GATGGTCCAT GATATTAAGA 
AAAAGCTAAA AGTCAAAGGC CCTCCAAGTG 

10 

CTGCAGATGA AGAGGAGCAG TGGTCAGATG 
AACATTCGGA CTCCGAGATG TATGTGATGC 
15 AACCGCCTCC CGC TGAGCAG CAGACACGGG 
GCGAGTATGT AGATAATCGA TCCAGCCAGC 
CCAGTAAGCC CAGCTGGCCT, TCAGCGAAAG 

20 

ACTCTCTACA GAAGCCTCAA GTCCCCCCCA 
ATTATGTGGT CCCTGTGGAA GATAACGATG 
25 CGCCGCCTGC TGAGAAGGCT CCCATGGTGA 
AGCACATGTC GCCTCCAGGG ACTGTCGCAG 
CATCTTTGCC TGCCGCACCA TCCCCACTAC 

30 

TTAAGACTAC TCCCGTTCCT CCCCTACCGA 
TTCCTGCTGA GCGCCACCGA GGGTCTAGTC 

3 5 TTCCTCCCAC CCAGAAACCT GTCCATCAAA 

CGGGGAGCCC AGCTGCAGAT GGACCGTTCC 
ACCAGGAGGG TGAACTGCTC GGTAAGCCCT 

40 

CTGAAGAGGC CTTGCACAGA TCCAACAAGG 
GCCATGATTC CAAGCAGCCG TACACCCTAG 

4 5 TTCCTGTACG GTTTATTGAA GCAACCAAAC 

AAGAGTACTT CGGAAGTGTT GTGGAAATCG 
TTATTGACAG TCAGAATAAC ACGAAAGATT 

50 

CATAACGATA CCACGGTTCC AGACATGTCC 
TCTGGCTATG ACTCCTCAGC AGTAAGAGAG 

5 5 GAATGAATGA ATCTACTCCT TCCTAGGGCG 
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TGGCTTGAGT TCTTGAGGCC AGAGCCTTAC 6 0 

CTGACACCCC CCTGGACAGC GACACATCCT 120 

TAACTGTCCC TGCCAGCCAG AAGCTGAGAC 180 

ACAATGAAGG TGGAATAATG GACAAGATAA 240 

TTCCTCGAAG GGACTATGCA TTAGACAGCC 300 

ACTTCGACAG TGACTATGAA AATCCAGATG 360 

CTGCCGAGGA GACGGGCGAC GATTCCTATG 420 

TGGTCCATCC AGCCCTGCCC TTCACGAGGG 48 0 

GGCACTCTCC GCCCTTCAGC AAGACACTTC 54 0 

CGAGGCTGGC CTCCACTCTG CCAGCCCCCA 6 00 

AGCCCAAAGA CCTCCTTGAG GATGAGG CTG 660 

AAAACTATAT CCATCCCAGA GAAAGTAGCC 720 

ATAGATCAAC CAAGCCAAAC AGTTCCTCAA 780 

GTCGAAACAG TGGGGTCTGG GACTCCAAGT 84 0 

CACGGGCTGG GAAGAAGCCA GCTACACCAC 900 

ATGCATCAAA TGTTTGTGAA GAAAAGCCTG 960 

ACAGACAAGA CACTGTACAG TCACCAGTGT 102 0 

AG C CTGTACC CTTGCCAAGG TTCCCAGAAG 108 0 

ACAGCTTCCC ATTTAATTTG ACGTTTG CAG 1140 

GGTATGCTGG CGCCTGTGAC CGCAAGTTTG 12 00 

ATGGATCGTT TCTTATTCGG AAGAGCTTTG 1260 

TTGCGTTCTT TAACAAGCGA GTGTATAATA 132 0 

AGTATGCTTT GGGAAAGAAG AAAAATGGTG 138 0 

TCAACAGTCA TCAGCACAAC CCCCTGGTTC 144 0 

CCACGAGACT GAAATATGCT GTGAAGGTTT 1500 

TCTGTTTCTT CTTTTGAGAA AACATCATAT 156 0 

AAAAGATGAA TGAAGCCACT GAGGCTTCGT 162 0 

TTCACACGAG CTTTTCTATC ACCTGACCTG 168 0 
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ACGAAGTCAT AGCTGGGGAG GTTCGGTTAC TATGATAC 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Tyr Glu Xaa Pro 
1 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

( C ) S TRANDEDNE S S : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1. .2 

(D) OTHER INFORMATION: /note= "The 'X' appearing at 
positions 1, 8 and 15, represent either Aspartic Acid or 
Glutamic Acid." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Xaa Xaa Tyr Glu Xaa Pro Xaa Xaa Xaa Tyr Xaa Xaa Pro Xaa Xaa Xaa 
15 10 15 

Tyr Glu Pro Pro 

20 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : misc^feature 

(B) LOCATION: 19.. 20 



WO 98/32852 



PCT/US98/01394 



-39- 

(D) OTHER INFORMATION: /note- "The ' N 1 at position 19 
represents Inosine. " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TCGAGAATTC AARAARCCNA CNACNCC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Lys Pro Thr Thr Pro Leu Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
{ D ) TO POLOG Y : unknown 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 13 , . 14 

(D) OTHER INFORMATION: /note= "The ' N 1 at position 13 
represents Inosine. " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

■ 

CTGAGGATCC TTNGTNGCYT CDATRAA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Lys Arg Val Tyr Asn lie Pro Val Arg Phe lie Glu Ala Thr Lys 
15 io 15 



